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Abstract 

The relationships between diversity, productivity and scale determine much of the structure 

and robustness of complex biological and social systems[l, 2]. While arguments for the link 
between specialization and productivity are common[3, 5, 6, 7, 4], diversity has often been 
invoked as a hedging strategy[8, 9], allowing systems to evolve in response to environmental 
change[8, 9, 10]. Despite their general appeal, these arguments have not typically produced 
quantitative predictions for optimal levels of functional diversity consistent with observations. 
One important reason why these relationships have resisted formalization is the idiosyncratic 
nature of diversity measures, which depend on given classification schemes[ll, 12]. Here, 
we address these issues by analyzing the statistics of professions in cities and show how their 
probability distribution takes a universal scale-invariant form, common to all cities, obtained 
in the limit of infinite resolution of given taxonomies. We propose a model that generates 
the form and parameters of this distribution via the introduction of new occupations at a rate 
leading to individual specialization subject to the preservation of access to overall function 
via their ego social networks. This perspective unifies ideas about the importance of network 
structure in ecology and of innovation as a recombinatory process with economic concepts of 
productivity gains obtained through the division and coordination of labor, stimulated by scale. 
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A fundamental theme across many complex systems[l, 2] - from ecosystems [13] to human 
behavior[14] and socio-economic organization[15, 16] - deals with understanding the mechanisms 
by which diversity arises and is sustained. In contemporary human societies socioeconomic di- 
versity is associated primarily with cities [17] accounting for their role in producing new ideas and 
stimulating development[17, 10, 4]. However, counter arguments have also been made noting that 
specialized cities are sometimes more productive [5, 6, 7, 4]. Familiar examples are contempo- 
rary Silicon Valley or manufacturing cities in their heyday. Nevertheless, these questions remain 
far from settled, in part because of the difficulties inherent to measuring diversity in any complex 
system[ll, 12]. 

Measures of diversity typically account for the presence, and sometimes the relative proportion[18, 
19, 20], of different functional types, for example different professions or business types in cities 
or nations, or different species in an ecosystem[21]. Such measures, are inevitably linked to par- 
ticular classification schemes or taxonomies. To appreciate this point consider the question: How 
many different professions are there in a large city, like New York? In general, there is no ob- 
jective answer to this question as it depends on how finely one differentiates similar functions. 
Here, we show that under specific conditions a limit of infinite resolution can be obtained in a way 
similar to the treatment of physical quantities close to phase transitions [22] and that, in this limit, 
scheme -independent measures of diversity can emerge. 

The simplest measure of diversity, D{N), counts the number of distinct professions present in 
a city. Fig. 1 A shows D, for US metropolitan areas vs. their total employment, Nf,. Because A^e is, 
on average, proportional to population[23], N, we use the two measures of scale interchangeably. 
D increases with A^e initially and then saturates for large cities and is well fit by 



Eq. 1 holds over time and for different levels of resolution, r, in the occupations hierarchical 
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classification scheme (see SI). The parameters in Eq. 1 are, in general, functions of r. The scale 
do{r) is the effective size of the classification scheme at resolution r, No{r) is a characteristic size 
of a city at which saturation starts. 7, empirically independent of r (see SI), is a scaling exponent 
giving the proportionality between the population growth rate and that of new occupations in the 
city, in the absence of saturation. 

Coarsening the hierarchical classification leads to similar saturation at each of the scheme's 
size c?o(r6), do(r5), do(r4), etc (Fig. IB). This behavior is the hallmark of a finite resolution arte- 
fact, a phenomenon well understood in terms of finite size scaling at phase transitions [22]. The 
explicit dependence of do on r means that given classification schemes are too coarse to capture 
the professional diversity of large US cities[24], beyond A^o ~ 10^. Nevertheless, we can use the 
variation of the statistics of occupations with r to derive classification scheme independent results. 
We reconcile all curves for D{N) at different r and extract their limit as r ^ 00. We define a 
dimensionless function h{N / Nq, 7) such that 



, N\ /NV \DoN\ N«No, 

■ ^ ^ ^ 'do{r), N»No, 

where Dq is a constant. Comparison with Eq. 1 tells us that in the limit ^ ^ 0, h ^ 1, Dq ^ 
and in the limit 00, /i—)- (^)^.A universal scaling regime exists if and only if the quantity 

Do = No(r)-y becomes a constant, independent of r, as r — > 00 (Fig. IB). Fig. IC shows do 
vs. Nq across r and over time. The relationship is well described by a straight-line with slope of 
Do = 0.05 across all years. These results suggest the existence of a resolution independent, scale- 
invariant limit for D{N) and show that the occupational diversity of cities is in fact open-ended: 
the number of distinct occupations in US cities increases by ~ 85% with each doubling of its labor 
force, meaning that larger cities are at once more diverse in absolute terms and more specialized 
per capita. These insights can be proven as simple theorem (see SI). 
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Beyond analyzing the presence or absence of professions, which gives only a crude measure 
of urban diversity, we can characterize their frequency distribution. The analysis of the frequency 
of different types in complex systems, from word frequency to city size, is naturally described in 
terms of their (Zipfian) rank-frequency distribution. To derive this distribution we identify D{N) 
with the maximum rank at each value of A^, which has probability p{D) = j^. Inverting this 
relation and generalizing it to all ranks, i, leads to the occupational frequency, f{i): 

/W^^f^V" (2) 



No 

This is also scheme independent in the large resolution limit and can be used to derive the proba- 
bility density, p{i), as 

which is also independent of r. The occupational probability has a residual dependence on N 
through D{N) because the rarest professions cannot have less than one person. This is the only 
source of city size dependence of traditional measures of diversity such as the Herfindahl-Hirschman 
index or the Shannon entropy (see SI)[18, 19, 21], which are functional of Given Eq. 3, both 
measures express increases in diversity (the Herfindahl-Hirschman index decreases, the Shannon 
entropy increases) towards a finite limit at infinite N. For large cities the approach to this limit is 
controlled by a term ~ N^, with 5=1 — 7 (see SI for derivation). 

Fig. 2 shows that the distribution of occupations for different cities is universal: When adjusted 
for scale, N^, all frequency curves collapse onto a single line. This shows that there is an expected 
nested sequence of occupations, predicted by city size, as expected by the hierarchy principle of 
central place theory [25, 20] and in analogy to products vs. level of economic development at the 
national level[ 15, 16]. 

A simple model that predicts the form of the occupational diversity distribution, Eq. 3, is a 



version of the Yule-Simon mechanism of preferential attachment[27, 26]: as the city grows by one 
more job, AA^e = 1> it creates a new occupation with probability a = = ^DqN]~^, or it takes 
up an existing profession, proportionally to its frequency, with probability 1 — a. For large Ne this 
predicts an exponent[27, 26] in the occupational distribution of 7 = i-^jv ) d(jv ) ' ^^^^ ^^^^ ^^^^ 
for small N^, a < 0.04 << 1. 

Given the results so far, we may expect economic productivity to be inversely proportional to 
professional diversity. Consider that indicators of economic productivity (wages, GDP) scale, on 
average, superlinearly[23] with A^, W{N,t) = Wo{t)N{ty, with Wo{t) and /? ~ 1 + 5 > 1 in- 
dependent of N (see SI). An average wage per capita is, then, w{N) = WqN^, where 5 ~ 1/6 ~ 
1 — 7. This result has been derived from a general theoretical framework that defines cities as 
co-located social networks, subject to infrastructural efficiency constraints[28], with w = Gk{N), 
where G is a constant in N, involving a balance between people and infrastructural properties, and 
k{N) — koN^ is the average social connectivity (degree) per person, which has been observed 
in urban telecommunication networks[29]. Similarly, diversity per capita, d{N) = D{N)/N = 
DqN'^~^ = DqN~^. Hence, we conclude that w ~ l/d. This relation is an expression of the 
abundant evidence in economics for specialization (a decrease in d{N)) as the source of increases 
in productivity [3, 7]. However, no city has become rich by reducing its occupational diversity to a 
single activity: What then is the optimal level of diversity that maximizes the economic productiv- 
ity of a city? 

To answer this question we observe that the process of specialization, by which an individual 
sheds tasks to others, requires that such functions remain tightly integrated so that overall function- 
ality is preserved. This implements a form of comparative advantage at the individual level, where 
increases in productivity at each node, gained through specialization, remain integrated with other 
necessary functions via social network links. We may therefore require that the number of func- 
tions directly accessible to each person is preserved as the city grows. The number of functions 
that each individual reaches directly through its social network is Nf — d.k, which we require 
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to stay constant. A, in N, d.k — A. We now reconcile the expectation that w ~ l/d with the 
claim that, like other urban socioeconomic outputs, w be proportional to social connectivity[28], 
w k. We write w = g{kd)/d, with g an analytic function, independent of N. We now maximize 
wages subject to the conservation of functionality across social links by a Lagrange multiplier pro- 
cedure (see SI) to show that d = A/k and that w{N) = g{A)/d{N) = g{A)/A k{N). Then, 
D{N) = A/k{N)N = A/koN^-^ = DqN^, which predicts the form of the scaling of occupa- 
tional diversity with city size. As shown above, this relation, taken across all N, also predicts the 
rank-size distribution of urban occupations and associated measures of diversity. 

In summary, we showed that the patterns of occupational diversity and economic productivity 
observed in US metropolitan areas can be derived from an integrated view of cities as socioe- 
conomic networks that promote a systematic division and coordination of labor without loss of 
overall functionalities available to individuals. Similar quantitative patterns characterize the tech- 
nological complexity of simpler human societies[30] and may be a property of networked systems 
that can experience open-ended increases in their productivity with scale. The reversibility of these 
processes, e.g. the existence of hysteresis in the externalization and reabsorption of functions by 
individuals and networks, may also underlie the resilience of many complex systems [13, 12, 9] 
under unexpected functional change or population loss. 
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Fig. 1. The number of distinct occupations in US Metropolitan Statistical Areas vs. total em- 
ployment. (A) The relationship between the number of professions present for each city (orange 
dots) and city size is well described by D{Ne) = '^oj^^^r^^' with do = 686, 7 = 0.84, 
No = 1.48 X 10^ (blue line). (B) D{Ne) at different labels of resolution of the occupational 
classification scheme, r^, with i = 6 the finest and i = 3 the coarsest. (C) do is proportional to 
across levels of classification scheme resolution and time, suggesting the there is a r-independent 
limit to the form of the occupational diversity of cities and that D is open-ended. In this limit, 
D{Ne) = DqN] and larger cities are always more diverse as a whole, but more specialized per 
capita. 
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Fig. 2. The distribution of occupations in US metropolitan areas is universal. (A) Frequency 
distribution for several cities with different population sizes only differ in their amplitude, which is 
set by city size and the extent to which they probe rare occupations. The horizontal grey line shows 
the minimum number of professions (thirty) reported. (B) The rank-probability distributions for 
different cities collapse on each other when adjusted for city size (total employment). The yellow 
line shows the fit of the universal form to f{i)/Ne = (^ '^°(7+t^)°'' j ' where we introduces a 
scale io — 3 at small ranks. The black line is the form of f{i) /N^ in the absence of saturation. 
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Supporting Information Text 

Data We adopt a definition of functional cities in terms of metropolitan statistical areas (MSAs). 
MSAs are collections of political units (counties) aggregated by the US census bureau based on a 
set of criteria that includes population size, density and commuting flows. There were about 403 
MSAs in 2010, providing an ample basis for studying occupational diversity across city size (with 
50K to 20M inhabitants) as well as any other urban characteristics. MSAs are integrated labor 
markets and the best official definition of functional cities in terms of a mixing population [1]. 

Data on professional occupations in US MSAs was obtained from the Bureau of Labor Statis- 
tics (BLS) [2] and is freely available online (http://www.bls.gov/oes/). All occupations in the U.S. 
economy are hierarchically classified on the basis of their similarities at different levels of aggre- 
gation based on the BLS's Standard Occupational Classification (SOC) scheme, which contains a 
total of 81 1 distinct professions at its finest (r = rg or 6-digit) level of resolution for 2010. 
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Fit methodology For each year, we estimated parameters of Eq. 1 at resolution for i —6, 5, 
4, and 3 digits level occupations (Fig. lA-B) through ordinary least squares [3] , using the Gauss- 
Newton method, which relies on linear approximations to the nonlinear mean function [4]. We then 
used 7, do, Nq parameters estimated for 2010 to fit Eq. 2. A small constant io = 3 is introduced to 
Eq. 2 to account for the initial curvature observed for most common occupations (Fig. 2B). Hence 
the fit in Fig. 2B) corresponds to 



.... 



i + io 

Note that io is not determined by the process of analytic continuation, valid only at high ranks, that 
we used to obtain the form of the frequency distribution and constitutes from that point of view a 
functional freedom that is motivated also by the Yule-Simon model [5]. 

Asymptotic relation between diversity and city size holds over time 

The relation between diversity of employment and the size of urban area is maintained across time. 
The diversity of professions in the urban area is well described by the relation: 
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d{N) = do y^y ^ (S-2) 



1 + 



where do is the effective size of the classification scheme (maximum number of occupations), A^o 
is the size of the city where saturation starts to occur and 7 the characteristic exponent describing 
how d increases with city size, in the absence of saturation. 
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Figure S.l: The functional relationship between diversity of professions, d, and city size holds 
over time 



Variation of 7 over time 

Variation of 7 over time is fairly stable. The last employment classification scheme was defined in 
2002, which may explain larger fluctuations (and lower values) of 7 prior to such date. 
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Figure S.2: Changes in the best fit value of the exponent 7 across time. 
Indices of diversity 

Diversity is most commonly measured in terms of functionals of the probability distribution of 
types, p{i). Examples of such functions are the Herfindahl-Hirschman Index (HH), the Shannon 
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entropy (S). 



The Herfindahl-Hirschman index (HH) measures how concentrated a distribution is. For this 
reason it is often applied to economic sectors to measure their concentration and the creation of 
monopoUes. Given the asymptotic form of the distribution (Eq. 6) it can be calculated analytically 
as 



Consequently the HH index decreases towards a small constant , set by the exponent 5, as cities 
grow. This expresses an increase in diversity with city size. Note that the asymptotic value for 
N ^ oo, with S ~ 1/6, is HH — > 0.028, which is typical of highly diverse (and competitive) 
markets. 

Similarly the Shannon entropy S measures the diversity of the occupational distribution as 



which increases with N towards the Pareto distribution limit at infinite N. Thus, the increase in 
entropy signals the increase in diversity of the occupational distribution as cities grow. Note that 
in both cases the increases in diversity are driven, at leading order, by a term of order N^. 

Larger cities are more diverse, but less diverse per capita 
Theorem 

The number of distinct professions scales sublinearly with city population size: 
For any not fully specialized city with N inhabitants (or workers), if professions are a property of 
individuals that cannot be accumulated and the number of distinct professions d is a scale invariant 
function of N then its exponent 7 < 1 (sublinear scaling). 
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Figure S.3: Measures of occupation diversity in U.S. urban systems, (a) Herfindahl-Hirschman 
Index; (b) Shannon entropy. Both plots show the results of evaluating these indices at r = tq, thus 
showing saturation in these measures of diversity for large cities. 

Proof: 

We take a profession as a property of a person that cannot be accumulated. As such, the total 
number of distinct professions in a city d, 

d 



< 1. 



From this it follows that 



N 



, , , AN 

\nd < \nN ^ — < 



(S.5) 



(S.6) 



d - N ' 

where A denotes a variation. Because is a scale invariant function of N, d = DqN"', then 

Ad AN 

for some real number 7, independent of A^. Then, from (S.6) and (S.7), it follows that 7 < 1. 
Because no city is fully specialized, the inequality holds strictly and 7 < 1, as stated. 

Productivity and mean annual wages 

We evaluated the superlinear relationship between city size and productivity, measured as the mean 
annual wages of each occupation in the dataset provided by the BLS. Although these data miss a 
few rare professions because of reporting cutoffs due to confidentiality issues we find a super 
linear scaling relation with exponent /3 = 1 + 5, 5 = 0.18 ± 0.03, in agreement with theoretical 
expectations of 5 ^ 1/6 [1]. 
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Figure S.4: Total wages in US MSAs in 2009 scale superlinearly with city population size, with an 
exponent /3 = 1.18 ± 0.03. 

Optimal Professional Diversity 

Here we show in greater detail how the observed open ended increase in labor productivity with 
city size, see previous section and Ref. [1], is related to the characterization of the changes in the 
levels of professional diversity reported in the main text. 

The main idea is that specialization as the source of increases in economic productivity must be 
accompanied by coordination of specialized tasks in order to maintain the original functionality. 
Thus, if an individual's tasks become more specialized, then the complementary functions shed 
in the process of specialization must be preserved within its socio-economic network, so that the 
overall function now exists in a social organization and less within the individual. We assume that 
only close coordination (at least in the initial stages of transfer of functions) will be able to preserve 
these specialized functions suitably integrated and as such that there must be a conservation of the 
number of functions within the immediate contacts (first neighbors) of each individual. Thus 
on the average we write this condition as k^^^ = kd = A, where A is a constant in A^, but 
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that may vary over time, e.g. due to changes in communication and transportation technologies. 
This conservation may also be able to maintain the ability of each individual to innovate through 
processes of functional combinations that shift functions from his own immediate responsibility to 
those of others with whom he is closely connected. 

The other main difference to Ref [1], is to start from the standard assumption that economic 
productivity is proportional to (labor) specialization, that is w ~ l/d, and not necessarily to social 
connectivity. To preserve dimensions we need to write instead that w — g{kd)/d, where g' is a 
function that transforms the index of specialization \/d into economic units (money per person 
and unit time). Its dependence on kd is necessary for consistency with observations and may give 
interesting insights on mechanisms of economic growth over time, which however we will not 
address here. 

Thus, we can formulate the problem of determining the optimal professional diversity of a 
city with size N (and consequently average per capita connectivity [1, 6\k — k^N^) in terms of 
maximizing it economic productivity subject to the constraint that activities lost to an individual 
remain available through its neighbors in their social network. This can be written in terms of a 
standard Lagrange multiplier optimization problem, 

C{d- A) = _\{kd-A). (S.8) 

Uj 

This optimization is a particular case of the problem considered in Ref. [1], where it was shown 
that the socioeconomic outputs of cities (including measures of economic output, such as wages, 
GDP, etc) are proportional, on the average, to their social connectivity, k. From this process we 
take k{N) as given and show how social connectivity and professional diversity are related. The 
solution of this optimization problem, obtained by taking the variations of C relative to each of the 
variables, d and A, to zero is 
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' A, (S.IO) 



where C is a constant of integration; Ai = X/k and is assumed to be a function of A. Both are 
to be set by boundary conditions on g{A). Identifying this solution with the empirical relations 
derived in the main text leads io Dq = ^. This shows that maximizing productivity subject to 
the maintenance of functionality among immediate network neighbors leads to a prediction for 
the professional diversity of cities in agreement with the findings of the main text, and shows 
explicitly how scale, productivity, functional diversity and network structure must be integrated in 
understanding the evolution of urban economies with population size. 
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